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Overall figure of merit (before solvent flattening): 0.68 



Refinement statistics 



Noa- hydrogen Water Resolution Reflections R.m.s. deviations 

Crystal protein atoms s Ions (A) total R^ 6 R^ 6 bonds (A)(°) angles 



IQN17/D10 516 150 1 10.0-1.5 13549 0.214 0.245 0.012 1.498 

IQN17 1143 160 1 5.0-2.5 7541 0.282 0.352 0.009 1.252 



^yn, = ZZj|lj - <l>|/22:j|<l>| ) where !j is tie recorded intensity of the reflection j and <1> is the mean recorded 
intensity over multiple recordings. . 
2 R iso - W ( u) ± - |F<w)llA:l F (M>l> waere F (X0 * the structure factor at wavelength hi and F (W) is the structure 
factor at the reference wavelength M. 

3 R culllB = 2||F W) x F^| - IF^JI/ZIF^ * F^l, where is the calculated heavy atom structure factor. 

"Phase power = <F h(X0 >/E, where <F h(w) >is the root-mean-square heavy atom structure factor and E is the residual 
lack of closure error. 

5 Occupancies are values output from MLPHARE. 

6 Rdy«i, fi« = EUFobJ - iF^JI/lFob.!, where the crystallographic and free R factors are calculated using the working 
and test sets, respectively. Test set contained 10% of reflections. 



[0242] While this invention has been particularly shown 
and described with references to preferred embodiments 
thereof, it will be understood by those skilled in the art that 



various changes in form and details may be made therein 
without departing from the spirit and scope of the invention 
as defined by the appended claims. 



SEQUENCE LISTING 



<160> NUMBER OF SEQ ID NOS: 68 

<210> SEQ ID NO 1 
<211> LENGTH: 33 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<2 23> OTHER INFORMATION: GCN4-PIQI 
<4 00> SEQUENCE: 1 

Arg Met Lys Gin lie Glu Asp Lye lie Glu Glu He Leu Ser Lys Gin 
15 10 15 

Tyr Hie He Glu Asn Glu He Ala Arg He Lye Lye Leu He Gly Glu 

20' 25 30 

Arg 



<210> SEQ ID NO 2 
<211> LENGTH: 45 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<2 20> FEATURE: 

<2 23> OTHER INFORMATION: IQN17 
<4 00> SEQUENCE: 2 



Arg Met Lys Gin He Glu Asp Lys He Glu Glu He Glu Ser Lys Gin 
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-continued 



Lys Lys lie Glu Aan Glu lie Ala Arg lie Lys Lys Leu Leu Gin Leu 

20 25 30 

Thr Vol Trp Gly lie Lys Gin Leu Gin Ala Arg lie Leu 
35 40 45 



<210> SEQ ID NO 3 

<211> LENGTH : 12 

<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 

<400> SEQUENCE: 3 

Cye Asp Leu Lys Ala Lys Glu Trp Phe Trp Leu Cys 
1 5 10 



<210> SEQ ID NO 4 
<211> LENGTH: 12 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 



<400> SEQUENCE: 4 

Cys Glu Ala Arg His Arg Glu Trp Ala Trp Leu Cys 
1 5 10 



<210> SEQ ID NO 5 
<211> LENGTH: 12 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION : D-peptide 
<4 00> SEQUENCE: 5 

Cys Glu Leu Leu Gly Trp Glu Trp Alo Trp Leu Cys 
15 10 



<210> SEQ ID NO 6 

<211> LENGTH: 12 

<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 

<400> SEQUENCE: 6 

Cys Leu Leu Arg Ala Pro Glu Trp Gly Trp Leu Cys 
1 5 10 



<210> SEQ ID NO 7 
<211> LENGTH: 12 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<400> SEQUENCE: 7 

Cys Ser Arg Ser Gin Pro Glu Trp Glu Trp Leu Cys 
15 10 



<210> SEQ ID NO 8 
<:211> LENGTH: 12 



WEST 
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-continued 



<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE I 

<223> OTHER INFORMATION i D-peptide 
<400> SEQUENCE: 8 

Cys Gly Leu Gly Gin Glu Glu Trp Phe Trp Leu Cys 
1 5 10 



<2i0> SEQ ID NO 9 
<211> LENGTH i 12 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE i 

<223> OTHER INFORMATION: D-peptide 
<4 00> SEQUENCE : 9 

Cys Met Arg Gly Glu Trp Glu Trp Ser Trp Leu Cys 
15 10 



<210> SEQ ID NO 10 
<2U> LENGTH: 12 
<212> TYPE : PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE t 

<223> OTHER INFORMATION : D-peptide 
<400> SEQUENCE: 10 

Cyo Pro Pro Leu Asn Lys Glu Trp Ala Trp Leu Cya 
15 10 



<210> SEQ ID NO 11 
<211> LENGTH: 12 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE : 

<223> OTHER INFORMATION: D-peptide 
<4 00> SEQUENCE: 11 

Cya Val Leu Lys Ala Lyo Glu Trp Phe Trp Leu Cyo 
15 10 



<210> SEQ ID NO 12 

<21i> LENGTH: 11 

<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE t 

<223> OTHER INFORMATION: D-peptide 

<221> NAME/KEY: VARIANT 

<222> LOCATION: (1)...(11) 

<223> OTHER INFORMATION: Xaa - Any Amino Acid 

<400> SEQUENCE : 12 

Cys Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp Leu 

1 5 10 



<210> SEQ ID NO 13 

<21l> LENGTH t 35 

<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: N36 

<400> SEQUENCE i 13 

Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Gin 



WEST 



US 2002/0077284 Al Jun. 20, 2002 

27 



-continued 



15 10 15 

Gin Hie Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala 
20 25 30 

Arg lie Leu 
35 



<210> SEQ ID NO 14 
<21l> LENGTH l 34 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220;> FEATURE: 

<223> OTHER INFORMATION: C34 
<4 00> SEQUENCE i 14 

Trp Met: Glu Trp Asp Arg Glu lie Asn Asn Tyr Thr Ser Leu lie His 
15 10 15 

Ser Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lyo Aon Glu Gin Glu 
20 25 30 

Leu Leu 



<210> SEQ ID NO 15 
<211> LENGTH i 16 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<400> SEQUENCE: 15 

Lys Lys Gly Ala Cys Gly Leu Gly Gin Glu Glu Trp Phe Trp Leu Cys 
15 10 15 



<210> SEQ ID NO 16 
<2U> LENGTH : 16 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<4 00> SEQUENCE: 16 

Lye Lys Gly Ala Cys Glu Leu Leu Gly Trp Glu Trp Ala Trp Leu Cys 
15 10 15 



<210> SEQ ID NO 17 
<2U> LENGTH x 18 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<400> SEQUENCE: 17 

Lys Lys Lys Lys Gly Ala Cys Glu Leu Leu Gly Trp Glu Trp Ala Trp 

15 10 15 

Leu Cys 



<210> SEQ ID NO 18 

<211> LENGTH: 16 

<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE t 

<223> OTHER INFORMATION: D-peptide 



<400> SEQUENCE: 18 
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-continued 



Ly. Ly. Gly Al. Cy. Me* Arg Gly Glu Trp Glu Trp Ser Trp Leu cy. 



1 



<210> SEQ ID NO 19 
<211> LENGTH; 18 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<400> SEQUENCE: 19 

Ly. Ly. Gly Al. Cy. Pro Pro Leu A.n Ly. Glu Trp Al. Trp Leu Cy. 
Ala Ala 

<210> SEQ ID NO 20 
<211> LENGTH: 17 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: HIV-1 Residues 
<400> SEQUENCE: 20 

Leu Leu Gin Leu Thr V.1 Trp Gly lie Ly. Gin Leu Gin Ala Arg II. 
1 5 10 

Leu 

<210> SEQ ID NO 21 
<211> LENGTH I 24 
<212> TYPE : PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: , M _ ^-—ina! End of N26 

<223> OTHER INFORMATION: 24 Reaiduea from the U- Terminal 

<400> SEQUENCE: 21 

Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
1 5 10 

Gin Gin His Leu Leu Gin Leu Thr 
20 

<210>'SEQ ID NO 22 
<211> LENGTH: 55 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: IQN24n 
<400> SEQUENCE: 22 

„. t Arg Hat Ly. Gin He Glu A.p Ly. lie Glu Glu IU Glu Ser Ly. 

1 » 10 

CI. Ly. Ly. II. Glu A.n Glu lie Al. Arg lie Ly. Ly. Leu He Ser 

20 25 
Oly Ila V.1 Gin Gin Gin A.n A.n Leu Leu Arg Al. lie Glu Al. Gin 

35 40 
Gin His Leu Leu Gin Leu Thr 



50 



55 



<210> SEQ ID NO 23 
<211> LENGTH: 4 
<:212> TYPE: PRT 
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<213> ORGANISM: Artificial Sequence 
<22D> FEATURE : 

<223> OTHER INFORMATION i D-peptide 
<221> NAME /KEY: VARIANT 
<222> LOCATION: (L)...(4) 

<223> OTHER INFORMATION: Xaa = Any Amino Acid 

<400> SEQUENCE: 23 

Trp Xaa Trp Leu 
1 



<210> SEQ ID NO 24 

<211> LENGTH: 5 

<212> TYPE : PRT 

<213> ORGANISMS Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (1)...(5) 

<223> OTHER INFORMATION: Xaa - Any Amino Acid 

<400> SEQUENCE: 24 

Glu Trp Xaa Trp Leu 
1 5 



<210> SEQ ID NO 25 

<211> LENGTH: 28 

<212> TYPE : PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Soluble, Triineric Version of the Coiled Coil 
Region fo GCN4 in IQN17 

<400:> SEQUENCE: 25 

Arg Met Lys Gin He Glu Asp Lys He Glu Glu He Glu Ser Lys Gin 
15 10 15 

Lys Lys He Glu Asn Glu He Ala Arg He Lye Lye 
20 25 



<210> SEQ ID NO 26 
<2U> LENGTH: 17 
<212> TYPE : PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: HIV-2 Sequence 
<400> SEQUENCE: 2 6 

Leu Leu Arg Leu Thr Val Trp Gly Thr Lys Asn Leu Gin Ala Arg Val 
15 10 15 

Thr 



<210> SEQ ID NO 27 
<2U> LENGTH: 17 
<212> TYPE: PRT 

<213? ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: SIV Sequence 
<400> SEQUENCE: 27 

Leu Leu Arg Leu Thr Val Trp Gly Thr Lys Asn Leu Gin Thr Arg Val 
15 10 15 



Thr 
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<210> SEQ ID NO 28 
<211» LENGTH: 16 
<212> TYPE I PRT 

<213> ORGANISM i Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<221> NAME /KEY: VARIANT 
<222> LOCATION: ( 1) . . . ( 16 > 

<223> OTHER INFORMATION: Xaa - Any Amino Acid 
<40D> SEQUENCE: 28 

Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp Leu Cye Xaa Xaa 
15 10 15 



<210> SEQ ID NO 29 
<211> LENGTH: 18 
<212> TYPE : PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION i D-peptide 
<221> NAME /KEY: VARIANT 
<222> LOCATION: (I).. .(18) 

<22 3> OTHER INFORMATION: Xaa = Any Amino Acid 
<40 0> SEQUENCE: 29 

Lye Lye Xaa Xaa Cya Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp Leu Cye 
15 10 15 

Xaa Xaa 



<210> SEQ ID NO 30 
<211> LENGTH: 20 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<221> NAME/KEY: VARIANT 
<222> LOCATION: (I)... (20) 

<223> OTHER INFORMATION : Xaa - Any Amino Acid 
<400> SEQUENCE: 30 

Lyo Lye Lyo Lyo Xaa Xaa Cya Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp 
15 10 15 

Leu Cys Xaa Xaa 

20 



<210> SEQ ID NO 31 

<211> LENGTH: 17 

<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (1)...(17) 

<223> OTHER INFORMATION: Xaa = Any Amino Acid 

<400> SEQUENCE: 31 

Xsa Xaa Cys Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp Leu Cys Xaa Xaa 
15 10 15 

Xaa 



<210> SEQ ID NO 32 

<211> LENGTH: 19 

<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 



Jun. 20, 2002 
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<223> OTHER INFORMATION: D-peptide 
<221> NAME /KEY: VARIANT 
<2 2 2> LOCATION! (1)...(19) 

<223> OTHER INFORMATION: Xaa - Any Amino Acid 
<4 00> SEQUENCE: 32 

Lys Lys Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp Leu Cys 
15 10 15 

Xaa Xaa Xaa 



<210> SEQ ID NO 33 
<211> LENGTH: 21 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<22 3> OTHER INFORMATION: D-peptide 
<221> NAME/KEY: VARIANT 
<222> LOCATION: { 1) ... (21 ) 

<223> OTHER INFORMATION : Xaa - Any Amino Acid 
<400> SEQUENCE i 3 3 

Lys Lys Lys Lys Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp 
15 10 15 

Leu Cys Xaa Xaa Xaa 
20 



<210> SEQ ID NO 3 4 
<211> LENGTH: 16 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<22 3> OTHER INFORMATION: D-peptide 
<400> SEQUENCE: 3 4 

Gly Ala Cys Glu Ala Arg Hie Arg Glu Trp Ala Trp Leu Cye Ala Ala 
15 10 15 



<210> SEQ ID NO 35 
<211> LENGTH: 16 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<22 3> OTHER INFORMATION: D-peptide 
<400> SEQUENCE: 35 

Gly Ala Cye Asp Leu Lye Ala Lys Glu Trp Phe Trp Leu Cye Ala Ala 
15 10 15 



<210> SEQ ID NO 36 
<211> LENGTH : 16 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<4O0> SEQUENCE: 36 

Gly Ala Cye Ser Arg Ser Gin Pro Glu Trp Glu Trp Leu Cye Ala Ala 
15 10 15 



<210> SEQ ID NO 37 

<211> LENGTH: 16 

<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 

<c220> FEATURE: 



US 2002/0077284 Al Jun. 20, 2002 

32 



-continued 



<223> OTHER INFORMATION : D-peptide 
<400> SEQUENCE i 37 

Gly Ala Cys Leu Leu Arg Ala Pro Glu Trp Gly Trp Leu Cys Ala Ala 
15 10 15 



<210> SEQ ID NO 38 
<2U> LENGTH i 18 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<400> SEQUENCE: 38 

Lys Lys Gly Ala Cys Glu Ala Arg Hie Arg Glu Trp Ala Trp Leu Cys 
15 10 15 

Ala Ala 



<210> SEQ ID NO 39 
<211> LENGTH: 18 
«c212> TYPE : PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<400> SEQUENCE: 39 

Lys Lye Gly Ala Cys Asp Leu Lys Ala Lys Glu Trp Phe Trp Leu Cys 
15 10 15 

Ala Ala 



<210> SEQ ID NO 4 0 
<:211> LENGTH: 18 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<400> SEQUENCE: 4 0 

Lys Lys Gly Ala Cys Ser Arg Ser Gin Pro Glu Trp Glu Trp Leu Cys 
15 10 15 

Ala Ala 



<210> SEQ ID NO 41 
<211> LENGTH: 18 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<4 00> SEQUENCE: 41 

Lye Lys Gly Ala Cys Leu Leu Arg Ala Pro Glu Trp Gly Trp Leu Cys 
15 10 15 

Ala Ala 



<210> SEQ ID NO 42 
<211> LENGTH: 17 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION : Invariant Residues in HIV-1, HIV-2 and SIV 
<221> NAME/KEY: VARIANT 
<222> LOCATION: (1)...(17) 

<223> OTHER INFORMATION : Xaa - Any Amino Acid 
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<400> SEQUENCE: 42 

Leu Leu Xaa Leu Thr Val Trp Gly Xaa Lyo Xaa Leu Gin Xaa Arg Xaa 
15 10 15 

Xaa 



<210> SEQ ID HO 4 3 
<211> LENGTH: 20 
<212> TYPE: PRT 

<2i3> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<400> SEQUENCE: 4 3 

Lys Lys Lys Lys Gly Ala Cys Glu Ala Arg Hie Arg Glu Trp Ala Trp 
15 10 15 

Leu Cys Ala Ala 

20 



<210> SEQ ID NO 44 
<2U> LENGTH: 16 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<:400> SEQUENCE: 44 

Gly Ala Cya Gly Leu Gly Gin Glu Glu Trp Phe Trp Leu Cys Ala Ala 
15 10 15 



<210> SEQ ID NO 4 5 
<2U> LENGTH: 20 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<400> SEQUENCE: 45 

Lye Lys Lyo Lys Gly Ala Cys Gly Leu Gly Gin Glu Glu Trp Phe Trp 
15 10 15 

Leu Cys Ala Ala 

20 



<210> SEQ ID NO 4 6 
<211> LENGTH: 20 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<400> SEQUENCE: 4 6 

Lys Lys Lys Lys Gly Ala Cys Asp Leu Lys Ala Lys Glu Trp Phe Trp 
15 10 15 

Leu Cys Ala Ala 
20 



<210> SEQ ID NO 47 

<211> LENGTH: 15 

<212> TYPE : PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 



WEST 



US 2002/0077284 Al Jun. 20, 2002 

34 



-continued 



<4 00> SEQUENCE : 4 7 

Gly Ala Cys Glu Leu Leu Gly Trp Glu Trp Ala Trp Leu Cye Cye 

15 10 15 



<210> SEQ ID NO 4 8 
<2U> LENGTH; 20 
<212> TYPE: PRT 

<213> ORGAN ISM i Artificial Sequence 
<220> FEATURE l 

<223> OTHER INFORMATION: D-peptide 
<4 0D> SEQUENCE: 4 8 

Lyo Lys Lyo Lyo Gly Ala Cys Ser Arg Ser Gin Pro Glu Trp Glu Trp 
15 10 15 

Leu Cys Ala Ala 
20 



<210> SEQ ID NO 4 9 
<211> LENGTH: 20 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<4 00> SEQUENCE: 4 9 

Lys Lys Lys Lys Gly Ala Cys Leu Leu Arg Ala Pro Glu Trp Gly Trp 
15 10 15 

Leu Cys Ala Ala 
20 



<:210> SEQ ID NO 5 0 
<211> LENGTH: 16 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<40D> SEQUENCE: 5 0 

Gly Ala Cys Met Arg Gly Glu Trp Glu Trp Ser Trp Leu Cys Ala Ala 
15 10 15 



<210> SEQ ID NO 51 
<211> LENGTH: 20 
<212> TYPE t PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<400> SEQUENCE: 51 

Lys Lys Lys Lye Gly Ala Cys Met Arg Gly Glu Trp Glu Trp Ser Trp 
15 10 15 

Leu Cys Ala Ala 
20 



<210> SEQ ID NO 52 

<211> LENGTH: 16 

<212> TYPE : PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 

<400> SEQUENCE: 52 



WEST 
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Gly Ala CyB Pro Pro Leu Asn Lye Glu Trp Ala Trp Leu Cys Ala Ala 

15 10 IS 



<210> SEQ ID NO 53 
<211> LENGTH : 20 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<4 00> SEQUENCE: 53 

Lye Lys Lys Lys Gly Ala Cys Pro Pro Leu Asn Lys Glu Trp Ala Trp 
15 10 15 

Leu Cys Ala Ala 
20 



<210> SEQ ID NO 54 
<211> LENGTH: 16 
<212> TYPE t PRT 

<213> ORGANISM i Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<:221> NAME /KEY: VARIANT 
<222> LOCATION: (!)...( 16) 

<223> OTHER INFORMATION: Xaa - Any Amino Acid 
<400> SEQUENCE: 5 4 

Gly Ala Cys Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp Leu Cys Ala Ala 
15 10 15 



<210> SEQ ID NO 55 
<211> LENGTH: 18 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE s 

<223> OTHER INFORMATION t D-peptide 
<221> NAME /KEY : VARIANT 
<2 2 2> LOCATION: (1)...(18> 

<223> OTHER INFORMATION: Xao - Any Amino Acid 
<400> SEQUENCE: 55 

Lys Lys Gly Ala Cye Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp Leu Cys 
15 10 15 

Ala Ala 



<210> SEQ ID NO 5 6 
<211> LENGTH: 20 
<212> TYPE : PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<221> NAME /KEY x VARIANT 
<2 2 2> LOCATION: (1)...(20> 

<223> OTHER INFORMATION: Xaa = Any Amino Acid 
<400> SEQUENCE: 5 6 

Lys Lye Lys Lys Gly Ala Cys Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp 
15 10 15 

Leu Cys Ala Ala 
20 



<210> SEQ ID NO 57 
<211> LENGTH: 16 
<212> TYPE: PRT 
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<213> ORGANISM : Artificial Sequence 
<220> FEATURE : 

<22 3> OTHER INFORMATION! D-peptide 
<221> NAME /KEY: VARIANT 
<222> LOCATION t (1)...(L6) 

<22 3> OTHER INFORMATION: Xaa » Any Amino Acid 
<400> SEQUENCE: 57 

Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp Leu Cys Xaa Xaa 
15 10 15 



<210> SEQ ID NO 58 
<211> LENGTH : 18 
<212> TYPE : PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<221> NAME /KEY: VARIANT 
<222> LOCATION : (1)...(18) 

<223> OTHER INFORMATION: Xaa - Any Amino Acid 
<400> SEQUENCE: 58 

Lys Lys Xoa Xaa Cys Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp Leu Cye 
15 10 15 

Xaa Xaa 



<210> SEQ ID NO 5 9 
<211> LENGTH : 20 
<212> TYPE : PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<e223> OTHER INFORMATION: D-peptide 
<221> NAME /KEY: VARIANT 
<222> LOCATION : (1)...(20) 

<223> OTHER INFORMATION: Xaa = Any Amino Acid 
<400> SEQUENCE: 5 9 

Lys Lys Lys Lys Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp 
15 10 15 



Leu Cya Xaa Xaa 

20 



<210> SEQ ID NO 6 0 
<2U> LENGTH : 17 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<221> NAME /KEY : VARIANT 
<222> LOCATION: < 1> ( 17 ) 

<223> OTHER INFORMATION: Xaa = Any Amino Acid 
<400> SEQUENCE: 60 

Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp Leu Cys Xaa Xaa 
15 10 15 

Xaa 



<210> SEQ ID NO 61 
<211» LENGTH: 19 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE : 

<223> v OTHER INFORMATION: D-peptide 
<221> NAME/KEY: VARIANT 
<222> LOCATION: (1)...(L9) 
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<223> OTHER INFORMATION: Xaa - Any Amino Acid 
<4 00> SEQUENCE i 61 

Lys Lys Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp Leu Cy6 
15 10 15 

Xaa Xaa Xaa 



<210> SEQ ID NO 62 
<211> LENGTH : 21 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE : 

<223> OTHER INFORMATION: D-peptide 
<221> NAME/KEY: VARIANT 
<2 2 2> LOCATION: <1)...(21) 

<223> OTHER INFORMATION: Xaa = Any Amino Acid 
<4 00> SEQUENCE: 62 

Lye Lye Lys Lys Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp 
15 10 15 

Leu Cye Xaa Xaa Xaa 
20 



<210> SEQ ID NO 6 3 

<211> LENGTH: 12 

<:212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE : 

<223> OTHER INFORMATION : Sequence Pattern in C-Terminal Residues in 

D-peptides 

<221> NAME /KEY: VARIANT 

<222> LOCATION: (!)...( 12) 

<223> OTHER INFORMATION: Xaa = Any Amino Acid 

<400> SEQUENCE: 6 3 

Cys Xaa Xaa Xaa Xaa Xaa Glu Trp Xaa Trp Leu Cys 
1 5 10 



<210> SEQ ID NO 6 4 
<211> LENGTH : 18 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<400> SEQUENCE : 6 4 

Lys Lys Gly Ala Cys Gly Leu Gly Gin Glu Glu Trp Phe Trp Leu Cys 
15 10 15 

Ala Ala 



<210> SEQ ID NO 65 
<211> LENGTH: 18 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE : 

<22 3> OTHER INFORMATION i D-peptide 
<400> SEQUENCE: 65 

Lys Lys Gly Ala Cys Glu Leu Leu Gly Trp Glu Trp Ala Trp Leu Cys 
15 10 15 



Ala Ala 
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<210> SEQ ID NO 66 
<21L> LENGTH; 20 
<212> TYPE ! PRT 

<213> ORGANISM i Artificial Sequence 
<220> FEATURE t 

<223> OTHER INFORMATION! D-peptide 
<400> SEQUENCE: 66 

Lys Lye Lys Lys Gly Ala Cys Glu Leu Leu Gly Trp Glu Trp Ala Trp 
15 10 15 

Leu Cys Ala Ala 
20 



<210> SEQ ID NO 67 
<211> LENGTH: 18 
<212s- TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION : D-peptide 
<400> SEQUENCE I 67 

Lys Lys Gly Ala Cys Met Arg Gly Glu Trp Glu Trp Ser Trp Leu CyB 
15 10 15 

Ala Ala 



<210> SEQ ID NO 6 8 
<2U> LENGTH: 18 
<212> TYPE: PRT 

<213> ORGANISM! Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: D-peptide 
<400> SEQUENCE: 68 

Lys Lys Gly Ala Cys Pro Pro Leu Asn Lys Glu Trp Ala Trp Leu Cyo 
15 10 15 

Ala Ala 



Wtatt is claimed is: 

/ l.X peptide which comprises a soluble, trimeric form of 
a coiled-coil and a sufficient portion of the N-peptide region 
of HIV gp41 to comprise the amino acid residues which 
fofha the pocket of the N-helix coiled-coil of HIV gp41. 
fa.) The peptide of claim 1 wherein the peptide is a 
E^eptide. 

(Hy The D-peptide of claim 2 wherein the coiled coil is 
selected from the group consisting of: 

(a) the coiled coil of GCN4-pI Q I; 

(b) the coiled coil of GCN4-pII; 

(c) the coiled coil of Moloney Murine Leukemia Virus; 
and 

/t^J the coiled coil of ABC heterotrimer. 

fi.iThe D-peptide of claim 3 wherein the amino acid 
sentence of the coiled coil is: RMKQ I EDK1EEEIESKQ K- 
KlE^EIARIKK (SEQ ID NO: 25). 

Ps. The D-peptide of claim 2 wherein the sufficient portion 
cfMl/e N peptide region of HIV gp41 comprises the 
sequence: LLQLTVWGIKQLQARIL (SEQ ID NO: 20). 

(i^he D-peptide of claim 5 which is IQN17 (SEQ ID NO: 



ffl 7.\\ D-peptide which is a soluble, trimeric peptide model 
o>the HIV gp41 hydrophobic pocket, wherein the D-peptide 
comprises SEQ ID NO: 25 and a sequence which comprises 
17 amino acid residues, wherein the 17 amino acid residues 
comprise the sequence: LLXLTVWGXKXLQXRXX (SEQ 
ID NO: 42), wherein L, T, V, W, G, K, Q and R are amino 
acid residues represented by the single letter amino acid 
code^and X is any D -amino acid residue. 
f8JThe D-peptide of claim 7 wherein the sequence which 
comprises 17 amino acid residues is selected from the group 
consisting of: SEQ ID NO: 20; SEQ ID NO: 26; SEQ ID 

NO: 27 and SEQ ID.NO: 42. 

977 5 TD : peptide selected from the group consisting of: 

(a) CDLKAKEWFWLC (SEQ ID NO: 3); 

(b) CEARHREWAWLC (SEQ ID NO: 4); 

(c) CELLGWEWAWLC (SEQ ID NO: 5); 

(d) CLLRAPEWGWLC (SEQ ID NO: 6); 

(e) CSRSQPEWEWLC (SEQ ID NO: 7); 

(f) CGLGQEEWFWLC (SEQ ID NO: 8); 

(g) CMRGEWEWSWLC (SEQ ID NO: 9); 
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(h) CPPLNKEWAWLC (SEQ ID NO: 10); 

(i) CVLKAKEWFWLC (SEQ ID NO: 11); 

G) KKGACGLGQEEWFWLC (SEQ ID NO: 15); 

(k) KKGACELLGWEWAWLC (SEQ ID NO: 16); 

(1) KKKKG ACELLGWE WAW LC (SEQ ID NO: 17); 

(m) KKGACMRGEWEWSWLC (SEQ ID NO: 18); 

(n) KKGACPPLNKEWAWLC (SEQ ID NO: 19); 

(o) a D-peptide comprising WXWL (SEQ ID NO: 23); 

(p) a D-peptide comprising EWXWL (SEQ ID NO: 24); 

(q) a D-peptide comprising CXXXXXEWXWL (SEQ ID 
NO: 12) 

(r) ac-GACEARHREWAWLCAA-am (SEQ ID NO: 34); 

(r) ac-KKGACEARHREWAWLCAA-am (SEQ ID NO: 

38) ; 

(t) ac-KKKKGACEARHREWAWLCAA-am (SEQ ID 
NO: 43); 

(u) ac-G ACGLG Q EE WFWLC AA-a m (SEQ ID NO: 44); 

(v) ac-KKGACGLGQEEWFWLCAA-am (SEQ ID NO: 

15) ; 

(w) ac- KKKKG ACGLGQ EEWFWLCAA-am (SEQ ID 
NO: 45) 

(x) ac-GACDLKAKEWFWLCAA-am (SEQ ID NO: 35); 

(y) ac-KKGACDLKAKEWFWLCAA-am (SEQ ID NO: 

39) ; 

(z) ac-KKKKG ACDLKAKEWFWLCAA- am (SEQ ID 
NO: 46); 

(a') ac-GACELLGWEWAWLCC-am (SEQ ID NO: 47); 

(b') ac-KKGACELLGWEWAWLCAA-am (SEQ ID NO: 

16) ; 

(c') ac-KKKKG ACELLGWEWAWLCAA- am (SEQ ID 
NO: 17); 

(d') ac -G ACSRSQ PE WE WLC AA-a m (SEQ ID NO: 36); 

(e') ac-KKGACSRSQPEWEWLCAA-am (SEQ ID NO: 

40) ; 

(f) ac-KKKKGACSRSQPEWEWLCAA-am (SEQ ID 
NO: 48); 

(g) ac-GACLLRAPEWGWLCAA-am (SEQ ID NO: 37); 

(h*) ac-KKGACLLRAPEWGWLCAA-am (SEQ ID NO: 

41) ; 

(i') ac-KKKKGACLLRAPEWGWLCAA-am (SEQ ID 
NO: 49); 

(j') ac-GACMRGEWEWSWLCAA-am (SEQ ID NO: 
50); 

(k') ac-KKGACMRGEWEWSWLCAA-am (SEQ ID 
NO: 18); 

(1') ac-KKKKG ACM R G E WE WS WLC AA- am (SEQ ID 
NO: 51); 

(m') ac-GACPPLNKEWAWLCAA-am (SEQ ID NO: 
52); 



(n') ac-KKGACPPLNKEWAWLCAA-am (SEQ ID NO: 
19); 

(o') ac-KKKKGACPPLNKEWAWLCAA-am (SEQ ID 
NO: 53); 

(p ! ) ac-GACXXXXXEWXWLCAA-am (SEQ ID NO: 
54); 

(q') ac-KKGACXXXXXEWXWLCAA-am (SEQ ID 
NO: 55); 

(r') ac-KKKKGACXXXXXEWXWLCAA-am (SEQ ID 
NO: 56); 

(s ( ) ac-XXCXXXXXEWXWLCXX-am (SEQ ID NO: 

57) ; 

(t ( ) ac-KKXXCXXXXXEWXWLCXX-am (SEQ ID NO: 

58) ; 

(u') ac-KKKKXXCXXXXXEWXWLCXX-am (SEQ ID 
NO: 59); 

(v') ac-XXCXXXXXEWXWLCXXX-am (SEQ ID NO: 
60); 

(w') ac-KKXXCXXXXXEWXWLCXXX-am (SEQ ID 
NO: 61); 

(x ! ) ac-KKKKXXCXXXXXEWXWLCXXX-am (SEQ 
ID NO: 62); and 

(y') a variant of a sequence of (a) through (x')> wherein the 
variant binds the N-helix coiled-coil cavity of HIV 
gp41, wherein ac- at the C- terminus and -am at the 

^N-terminus are optional. 

fio. JThe peptide of claim 1 wherein the peptide is an 
L-)ejfide. 

^IJL/The L-peptide of claim 10 wherein the soluble, 
trimeric coiled-coil is selected from the group consisting of: 

(a) the coiled coil of GCN4-pI Q I; 

(b) the coiled coil of GCN4-pII; 

(c) the coiled coil of Moloney Murine Leukemia Virus; 
and 

(d) the coiled coil of ABC heterotrimer. 

(i2J The L-peptide of claim 10 wherein the sufficient 
portion of the N peptide region of HIV gp41 comprises the 
sequence: LLQLTVWGIKQLQARIL (SEQ ID NO: 20). 

(l3)The L-peptide of claim 12 which is IQN17 (SEQ ID 
N&S2). 

{14.\\n L-peptide which is a soluble, trimeric model of the 
HlV^gpl hydrophobic pocket, wherein the L-peptide com- 
prises SEQ ID NO: 25 and a sequence which comprises 17 
amino acid residues, wherein the 17 amino acid residues 
comprise the sequence: LLXLTVWGXKXLQXRXX, 
wherein L, T, V, W, G, K, Q and R are amino acid residues 
represented by the single letter amino acid code and X is any 
D -amino acid residue. 

^lyThe L-peptide of claim 14 wherein the sequence 
which comprises 17 amino acid residues is selected from the 
group consisting of: SEQ ID NO: 20; SEQ ID NO: 26; and 
SEQ ID NO: 27. 

16. A method of identifying andrug-tliat-interferes with 
formation of a complex between C34 peptide and N36 
peptide, comprising: 
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(a) combining a candidate drug to be assessed for its 
ability to interfere with formation of a complex 
between C34 peptide and N36 peptide, C34 peptide and 
N36 peptide, under conditions appropriate for formatin 
of a complex between C34 peptide and N36 peptide, 
thereby forming a test sample; and 

(b) determining whether formation of a complex between 
C34 peptide and N36 peptide is inhibited, 

wherein if formation of the complex is inhibited, the 
candidate drug is a drug that interferes with formation 
of the complex whereby a drug that interferes with 
formation of the complex is identified. 

17. The method of claim 16 wherein a control sample is 
formed by combining C34 peptide and N36 peptide, under 
the same conditions as the conditions under which the test 
sample is formed in (a); formation of a complex between 
C34 peptide and N36 peptide is determined and the extent to 
which the complex is formed in the test sample is compared 
with the extent to which the complex is formed in the control 
sample, wherein if the complex is formed to a lesser extent 
in the test sample than in the control sample, the candidate 
drug is a drug that interferes with formation of the complex, 
whereby a drug that interferes with formation of the com- 
plex is identified. 

18. The method of claim 16 wherein C34 peptide and N36 
peptide are each labeled by a member of a pair of donor- 
acceptor molecules and the extent to which formation of a 
complex between C34 and N36 occurs is assessed by 
determining the extent to which light emission occurs from 
the acceptor molecule, wherein if light emission occurs to a 
lesser extent in the presence of the candidate drug than in the 
absence of the candidate drug, the candidate drug is a drug 
that interferes with formation of a complex between C34 
peptide and N36 peptide. 

19. The method of claim 17 wherein C34 peptide and N36 
peptide are each labeled by a member of a pair or donor- 
acceptor molecules and the extent to which light emission 
occurs is assessed in the test sample and in the control 
sample, wherein if light emission is less in the test sample 
than in the control sample, the candidate drug is a drug 
which inhibits formation of a complex between C34 prptide 
and N36 peptide. 

20. The method of claim 16 further comprising assessing 
whether the drug that interferes with formation of the 
complex is an inhibitor of HIV entry into cells by assessing 
the effect of the drug on cell/cell fusion or HTV infection of 
cells is less in the presence of the drug than in its absence, 
the drug is an inhibitor of HIV entry into cells. 

21. A method of eliciting an immune response in an 
individual, comprising introducing into the individual a 
peptide comprising a trimeric form of a coiled-coil region of 
a protein and a sufficient portion of the N-peptide region of 
HIV gp41 to comprise the amino acid residues which form 
part or all of the N-helix coiled-coil of HIV gp41 and the 
peptide is present in a pharmaceutically acceptable carrier. 

22. The method of claim 21 wherein the peptide is 
introduced into the individual by a route of administration 
selected from the group consisting of: intramuscularly, intra- 
peritoneally, orally, nasally and transdermally. 

23. The method of claim 21 wherein the coiled-coil is 
selected from the group consisting of: GCN4-pI Q I; GCN4- 
pll; Moloney Murine Leukemia Virus and ABC heterotri- 
mer. 



24. The method of claim 21 wherein the peptide is IQN17. 

25. A method of interfering with entry of HIV into a 
mucosal cell comprising administering or applying to a 
mucosal surface a composition comprising: (1) a drug which 
binds HIV envelope protein gp41 subunit and interferes with 
entry of HIV into cells of the mucosal surface and (2) a 
carrier or base. 

26. The method of claim 25 wherein the drug binds the 
cavity on the surface of the N-helix coiled-coil of HIV 
envelope protein gp41 subunit, 

27. The method of claim 26 wherein the drug prevents or 
reduces the gp41 conformational change, thereby interfering 
with entry of HIV into cells of the mucosal surface. 

28. The method of claim 25 wherein the composition 
comprises a component selected from the group consisting 
of: 

(a) C34 peptide; 

(b) DP178; 

(c) DP649; 

(d) T1249; 

(e) a derivative of (a)-(d); 

(f) a D-peptide which binds to the hydrophobic pocket of 
HIV gp41; 

(g) a derivative of (f); 

(h) a combination of two or more of (a)-(g); and 

(i) a molecule that inhibits HIV infectivity by binding to 
the N-helix coiled coil. 

29. The method of claim 28 wherein the carrier or base is 
selected from the group consisting of: a foam, a gel, other 
substance sufficiently viscous to retain the drug, water and a 
buffer. 

30. The method of claim 28 wherein the carrier or base is 
a vaginal suppository or rectal suppository. 

31. The method of claim 28 wherein the drug is released 
from the carrier or base immediately or soon after it is 
administered or applied to the vagina, mouth or rectum. 

32. The method of claim 28 wherein the drug is released 
from the carrier or base gradually or after a specified period 
after it is administered or applied to the vagina, mouth or 
rectum. 

33. The method of claim 28 wherein the drug is on the 
surface of or incorporated within a contraceptive device in 
a manner which permits release of the drug under conditions 
of use. 

34. The method of claim 28 wherein the D-peptide of (e) 
comprises an amino acid sequence selected from the group 
consisting of: 

(a) CDLKAKEWFWLC (SEQ ID NO: 3); 

(b) CEARHREWAWLC (SEQ ID NO: 4); 

(c) CELLGWEWAWLC (SEQ ID NO: 5); 

(d) CLLRAPEWGWLC (SEQ ID NO: 6); 

(e) CSRSQPEWEWLC (SEQ ID NO: 7); 

(f) CGLGQEEWFWLC (SEQ ID NO: 8); 

(g) CMRGEWEWSWLC (SEQ ID NO: 9); 

(h) CPPLNKEWAWLC (SEQ ID NO: 10); 
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(i) CVLKAKEWFWLC (SEQ ID NO: 11); 

(j) KKGACGLGQEEWFWLC (SEQ ID NO: 15); 

(k) KKGACELLGWEWAWLC (SEQ ID NO: 16); 

(1) KKKKG ACELLG WEWAW LC (SEQ ID NO: 17); 

(m) KKGACMRGEWEWSWLC (SEQ ID NO: 18); 

(n) KKGACPPLNKEWAWLC (SEQ ID NO: 19); 

(o) a D-peptide comprising WXWL (SEQ ID NO: 23); 

(p) a D-peptide comprising EWXWL (SEQ ID NO: 24); 

(q) a D-peptide comprising CXXXXXEWXWL (SEQ ID 
NO: 12) 

(r) ac-GACEARHREWAWLCAA-am (SEQ ID NO: 34); 

(r) ac-KKGACEARHREWAWLCAA-am (SEQ ID NO: 

38) ; 

(t) ac-KKKKGACEARHREWAWLCAA-am (SEQ ID 
NO: 43); 

(u) ac-GACGLGQEEWFWLCAA-am (SEQ ID NO: 44); 

(v) ac-KKGACGLGQEEWFWLCAA-am (SEQ ID NO: 

15) ; 

(w) ac- KKKKG ACGLG QEEWFWLCAA-am (SEQ ID 
NO: 45) 

(x) ac-GACDLKAKEWFWLCAA-am (SEQ ID NO: 35); 

(y) ac-KKG ACD LKAKEWFWLC AA- a m (SEQ ID NO: 

39) ; 

(z) ac-KKKKGACD LKAKEWFWLC AA- am (SEQ ID 
NO: 46), 

(a') ac-GACELLGWEWAWLCC-am (SEQ ID NO: 47); 

(b') ac-KKGACELLGWEWAWLCAA-am (SEQ ID NO: 

16) ; 

(c') ac-KKKKGACELLGWEWAWLC AA-am (SEQ ID 
NO: 17); 

(d) ac-GACSRSQPEWEWLCAA-am (SEQ ID NO: 36); 

(e<) ac-KKG ACSRSQPEWEWLCAA-am (SEQ ID NO: 

40) ; 

(f) ac- KKKKG ACSRSQPEWEWLCAA-am (SEQ ID 
NO: 48); 

(g') ac-GACLLRAPEWGWLCAA-am (SEQ ID NO: 37); 

(h 1 ) ac-KKGACLLRAPEWGWLCAA-am (SEQ ID NO: 

41) ; 

(O ac-KKKKGACLLRAPEWGWLCAA-am (SEQ ID 
NO: 49); 

G') ac-GACMRGEWEWSWLCAA-am (SEQ ID NO: 
50); 

(k') ac-KKGACMRGEWEWSWLCAA-am (SEQ ID 
NO: 18); 

(1') ac-KKKKGACMRGEWEWSWLCAA-am (SEQ ID 
NO: 51); 

(m') ac-GACPPLNKEWAWLCAA-am (SEQ ID NO: 
52); 



(n') ac-KKGACPPLNKEWAWLCAA-am (SEQ ID NO: 
19); 

(o') ac-KKKKGACPPLNKEWAWLCAA-am (SEQ ID 
NO: 53); 

(p') ac-GACXXXXXEWXWLCAA-am (SEQ ID NO: 
54); 

(q') ac-KKGACXXXXXEWXWLCAA-am (SEQ ID 
NO: 55); 

(r') ac-KKKKGACXXXXXEWXWLCAA-am (SEQ ID 
NO: 56); 

(s') ac-XXCXXXXXEWXWLCXX-am (SEQ ID NO: 

57) ; 

(f) ac-KKXXCXXXXXEWXWLCXX-am (SEQ ID NO: 

58) ; 

(u f ) ac-KKKKXXCXXXXXEWXWLCXX-am (SEQ ID 
NO: 59); 

(V) ac-XXCXXXXXEWXWLCXXX-am (SEQ ID NO: 
60); 

(w') ac-KKXXCXXXXXEWXWLCXXX-am (SEQ ID 
NO: 61); 

(x') ac-KKKKXXCXXXXXEWXWLCXXX-am (SEQ 
ID NO: 62); and 

(y') a variant of a sequence of (a) through (x% wherein the 
variant binds the N-helix coiled-coil cavity of HIV 
gp41, wherein ac- at the C-terminus and -am at the 
N-terminus are optional. 

35. A method of identifying a compound or molecule 
which binds the N-helix coiled-coil cavity of HIV-1 gp41 
envelope protein, wherein the compound or molecule to be 
assessed is referred to as a candidate inhibitor, comprising: 

(a) combining a D-peptide which binds the N-helix 
coiled-coil cavity, a fusion protein which is a soluble 
model which presents the N-helix coiled-coil cavity 
and a candidate inhibitor, under conditions appropriate 
for binding of the D-peptide to the N-hclix coiled-coil 
cavity, thereby producing a test sample; 

(b) determining the extent to which binding of the D-pep- 
tide to the N-helix coiled-coil cavity in the test sample; 
and 

(c) comparing the extent of binding determined in to the 
N-helix coiled-coil cavity in a control sample, wherein 
the control sample is the same as the test sample except 
that the control sample does not include the candidate 
inhibitor and is maintained under the same conditions 
appropriate for binding of the D-peptide to the N-helix 
coiled-coil cavity as is the test sample, 

wherein if the extent of binding in the test sample is less 
than the extent of binding in the control sample, the 
candidate inhibitor is a compound or molecule which 
binds the N-helix coiled-coil cavity of HIV-1 gp41 
envelope protein. 

36. The method of claim 35 wherein the fusion protein is 
IQN17. 

37. The method of claim 35 wherein the D-peptide is 
labeled with a fluorescent reporter and the fusion protein is 
labeled with a quencher which, when in sufficiently close 
proximity to the fluorescent reporter, quenches the signal 
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from the reporter and detection of a signal from the fluo- 
rescent reporter indicates that the candidate inhibitor is a 
compound or molecule which binds the N-helix coiled-coil 
c avj l^jg^HIV^Lgp41-CB velope-pret&kh 

/p8\\TKIsion protein comprising a trimeric form of the 
coited-coil region of GCN4 and a portion of the N-peptide 
region of HIV-1 gp41, wherein the portion of the N-peptide 
region of gp41 comprises part or all or none of the N-helix 
coitfcd-coil pocket of HIV-1 gp41. 

'QiyA fusion protein of claim 38 wherein the portion of the 
N-peptide region of HIV gp41 comprises the following 24 
amino acid residues of HIV: SGIVQQQNNLL- 
RAIEAQQHLLQLT. 

40. A method of eliciting an immune response in an 
individual, comprising introducing into the individual a 
fusion protein comprising a soluble, trimeric form of a 
coiled-coil and a sufficient portion of the N-peptide region of 
HIV-1 gp41, to comprise the amino acid residues which 
form the pocket of the N-helix coiled-coil of HIV-1 gp41, 
wherein the fusion protein is present in a pharmaceutically 
acceptable carrier. 

41. AD-peptide which comprises at least four amino acid 
residues and comprises the consensus sequence WXWL, 
wherein W represents D-tryptophan, L represents D-leucine 
and X represents any moiety. 

42. The D-peptide of laim 41 wherein X is a D-amino acid 
residue or a modified D-amino acid residue. 

43. The D-peptide of claim 41, wherein the D-peptide 
comprises 2 to 21 amino acid residues. 

44. A D-peptide which comprises at least five amino acid 
residues, wherein the at least five amino acid residues are 
EWXWL, wherein E represents D-glutamic acid, W repre- 
sents D-tryptophan, L represents D-leucine and X represents 
an amino acid residue, a modified amino acid residue or a 
moiety other than an amino acid residue. 

45. AD-peptide which comprises an amino acid sequence 
selected from the group consisting of: 

(a) CDLKAKEWFWLC (SEQ ID NO: 3); 

(b) CEARHREWAWLC (SEQ ID NO: 4); 

(c) CELLGWEWAWLC (SEQ ID NO: 5); 

(d) CLLRAPEWGWLC (SEQ ID NO: 6); 

(e) CSRSQPEWEWLC (SEQ ID NO: 7); 

(f) CGLGQEEWFWLC (SEQ ID NO: 8); 

(g) CMRGEWEWSWLC (SEQ ID NO: 9); 

(h) CPPLNKEWAWLC (SEQ ID NO: 10); 

(i) CVLKAKEWFWLC (SEQ ID NO: 11); 

(j) KKGACGLGQEEWFWLC (SEQ ID NO: 15); 

(k) KKGACELLGWEWAWLC (SEQ ID NO: 16); 

(1) KKKKGACELLGWEWAWLC (SEQ ID NO: 17); 

(m) KKGACMRGEWEWSWLC (SEQ ID NO: 18); 

(n) KKGACPPLNKEWAWLC (SEQ ID NO: 19); 

(o) a D-peptide comprising WXWL (SEQ ID NO: 23); 

(p) a D-peptide comprising EWXWL (SEQ ID NO: 24); 

(q) a D-peptide comprising CXXXXXEWXWL (SEQ ID 
NO: 12) 



(r) ac-GACEARHREWAWLCAA-am (SEQ IL) NO: 34); 

(r) ac-KKGACEARHREWAWLCAA-am (SEQ ID NO: 

38) ; 

(t) ac-KKKKGACEARHREWAWLCAA-am (SEQ ID 
NO: 43); 

(u) ac-G ACGLG Q EE WFWLCAA-am (SEQ ID NO: 44); 

(v) ac-KKGACGLGQEEWFWLCAA-am (SEQ ID NO: 

15) ; 

(w) ac-KKKKGACGLGQEEWFWLCAA-am (SEQ ID 
NO: 45) 

(x) ac-GACDLKAKEWFWLCAA-am (SEQ ID NO: 35); 

(y) ac-KKGACDLKAKEWFWLCAA-am (SEQ ID NO: 

39) ; 

(z) ac-KKKKGACDLKAKEWFWLCAA-am (SEQ ID 
NO: 46); 

(a') ac-GACELLGWEWAWLCC-am (SEQ ID NO: 47); 

(b 1 ) ac-KKGACELLGWEWAWLCAA-am (SEQ ID NO: 

16) ; 

(c') ac-KKKKG ACELLGWE WAWLCAA- am (SEQ ID 
NO: 17); 

(d') ac-GACSRSQPEWEWLCAA-am (SEQ ID NO: 36); 

(e') ac-KKGACSRSQPEWEWLCAA-am (SEQ ID NO: 

40) ; 

(F) ac-KKKKGACSRSQPEWEWLCAA-am (SEQ ID 
NO: 48); 

(g') ac -G ACLLRAPEWG WLC AA- am (SEQ ID NO: 37); 

(h') ac-KKGACLLRAPEWGWLCAA-am (SEQ ID NO: 

41) ; 

0') ac-KKKKGACLLRAPEWGWLCAA-am (SEQ ID 
NO: 49); 

(j') ac-GACMRGEWEWSWLCAA-am (SEQ ID NO: 
50); 

00 ac-KKGACMRGEWEWSWLCAA-am (SEQ ID 
NO: 18); 

(1*) ac-KKKKGACMRGEWEWSWLCAA-am (SEQ ID 
NO: 51); 

(m r ) ac-GACPPLNKEWAWLCAA-am (SEQ ID NO: 
52); 

(n') ac-KKGACPPLNKEWAWLCAA-am (SEQ ID NO: 
19); 

(o') ac-KKK KG ACPPLNKE WAWLCAA- am (SEQ ID 
NO: 53); 

(p') ac-GACXXXXXEWXWLCAA-am (SEQ ID NO: 
54); 

(q-) ac-KKGACXXXXXEWXWLCAA-am (SEQ ID 
NO: 55); 

(0 ac-KKKKGACXXXXXEWXWLCAA-am (SEQ ID 
NO: 56); 

(s f ) ac-XXCXXXXXEWXWLCXX-am (SEQ ID NO: 
57); 
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(0 ac-KKXXCXXXXXEWXWLCXX-am (SEQ ID NO: 
58); 

(u') ac-KKKKXXCXXXXXEWXWLCXX-am (SEQ ID 
NO: 59); 

(v') ac-XXCXXXXXEWXWLCXXX-am (SEQ ID NO: 
60); 

(v/) ac-KKXXCXXXXXEWXWLCXXX-am (SEQ ID 
NO: 61); 

(x') ac-KKKKXXCXXXXXEWXWLCXXX-am (SEQ 
ID NO: 62); and 

(y 1 ) a variant of a sequence of (a) through (x'), wherein the 
variant binds the N-helix coiled-coil cavity of HIV 
gp41, wherein ac- at the C-terminus and -am at the 
N-terminus are optional. 

46. A method of identifying a drug that binds the N-helix 
coiled-coil cavity of HIV gp41 comprising: 

(a) combining: (1) a candidate drug to be assessed for its 
ability to bind the N-helix coiled-coil cavity of HIV 
gp41 and; (2) a fusion protein which comprises a 
trimeric version of the coiled-coil region of a protein 
and a sufficient portion of the N-peptide of HIV gp41 
to include the HIV gp41 cavity, under conditions appro- 
priate for presentation of the HIV gp41 cavity for 
binding by a drug; and 

(b) determining whether the candidate drug binds the HIV 
gp41 cavity, wherein if binding occurs, the candidate 
drug is a drug which binds the N-helix coiled-coil 
cavity of HIV gp41. 

47. The method of claim 46 wherein in (a), a peptide 
which binds the N-helix coiled-coil cavity of HIV gp41 is 
combined with the candidate drug and the fusion protein and 
in (b), whether the candidate drug binds the HIV gp41 cavity 
is determined in the presence of the peptide which binds the 
N-helix coiled-coil cavity of HIV gp41. 

48. The method of claim 42 wherein the peptide which 
binds the N-helix coiled-coil cavity of HIV gp41 is selected 
from the group consisting of: 

(a) CDLKAKEWFWLC (SEQ ID NO: 3); 

(b) CEARHREWAWLC (SEQ ID NO: 4); 

(c) CELLGWEWAWLC (SEQ ID NO: 5); 

(d) CLLRAPEWGWLC (SEQ ID NO: 6); 

(e) CSRSQPEWEWLC (SEQ ID NO: 7); 

(f) CGLGQEEWFWLC (SEQ ID NO: 8); 

(g) CMRGEWEWSWLC (SEQ ID NO: 9); 

(h) CPPLNKEWAWLC (SEQ ID NO: 10); 

(i) CVLKAKEWFWLC (SEQ ID NO: 11); 

(j) KKGACGLGQEEWFWLC (SEQ ID NO: 15); 
(k) KKGACELLGWEWAWLC (SEQ ID NO: 16); 
(1) KKKKGACELLGWEWAWLC (SEQ ID NO: 17); 
(m) KKGACMR GE WEWSWLC (SEQ ID NO: 18); 
(n) KKGACPPLNKEWAWLC (SEQ ID NO: 19); 
(o) a D-peptide comprising WXWL (SEQ ID NO: 23); 
(p) a D-peptide comprising EWXWL (SEQ ID NO: 24); 



(q) a D-peptide comprising CXXXXXE WXWL (SEQ ID 
NO: 12) 

(r) ac-GACEARHREWAWLCAA-am (SEQ ID NO: 34); 

(r) ac-KKGACEARHREWAWLCAA-am (SEQ ID NO: 

38) ; 

(t) ac-KKKKGACEARHREWAWLCAA-am (SEQ ID 
NO: 43); 

(u) ac-GACGLGQEEWFWLCAA-am (SEQ ID NO: 44); 

(v) ac-KKGACGLGQEEWFWLCAA-am (SEQ ID NO: 

15) ; 

(w) ac-KKKKGACGLGQEEWFWLCAA-am (SEQ ID 
NO: 45) 

(x) ac-GACDLKAKEWFWLCAA-am (SEQ ID NO: 35); 

(y) ac-KKGACDLKAKEWFWLCAA-am (SEQ ID NO: 

39) ; 

(z) ac-KKKKGACDLKAKEWFWLCAA-am (SEQ ID 
NO: 46); 

(a 1 ) ae- GACE LLG W E WAWLCC- am (SEQ ID NO: 47); 

(b) ac-KKGACELLGWEWAWLCAA-am (SEQ ID NO: 

16) ; 

(c') a c-KKKKG ACELLG WE WAWLC AA- am (SEQ ID 
NO: 17); 

(d') ac-GACSRSQPEWEWLCAA-am (SEQ ID NO: 36); 

(e 1 ) ac-KKGACSRSQPEWEWLCAA-am (SEQ ID NO: 

40) ; 

(f) ac-KKKKGACSRSQPEWEWLCAA-am (SEQ ID 
NO: 48); 

(g') ac-GACLLRAPEWGWLCAA-am (SEQ ID NO: 37); 

(h*) ac-KKGACLLRAPEWGWLCAA-am (SEQ ID NO: 

41) ; 

(i f ) ac-KKKKGACLLRAPEWGWLCAA-am (SEQ ID 
NO: 49); 

(j') ac-G ACM RGEWEWS WLCAA-am (SEQ ID NO: 
50); 

(k') ac-KKGACMRGEWEWSWLCAA-am (SEQ ID 
NO: 18); 

(1') ac-KKKKGACMRGEWEWSWLCAA-am (SEQ ID 
NO: 51); 

(nV) ae-GACPPLNKEWAWLCAA-am (SEQ ID NO: 
52); 

(n') ac-KKGACPPLNKEWAWLCAA-am (SEQ ID NO: 
19); 

(o') ac-KKKKGACPPLNKEWAWLCAA-am (SEQ ID 
NO: 53); 

(p') ac-GACXXXXXEWXWLCAA-am (SEQ ID NO: 
54); 

(q 1 ) ac-KKGACXXXXXEWXWLCAA-am (SEQ ID 
NO: 55); 

(r') ac-KKKKGACXXXXXEWXWLCAA-am (SEQ ID 
NO: 56); 
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(s') ac-XXCXXXXXEWXWLCXX-am (SEQ ID NO: 

57) ; 

(t 1 ) ac-KKXXCXXXXXEWXWLCXX-am (SEQ ID NO: 

58) ; 

(u') ac-KKKKXXCXXXXXEWXWLCXX-am (SEQ ID 
NO: 59); 

(v') ac-XXCXXXXXEWXWLCXXX-am (SEQ ID NO: 
60); 

(w*) ac-KKXXCXXXXXEWXWLCXXX-am (SEQ ID 
NO: 61); 

(x') ac-KKKKXXCXXXXXEWXWLCXXX-am (SEQ 
ID NO: 62); and 

(y 1 ) a variant of a sequence of (a) through (x ( ), wherein the 
variant binds the N-helix coiled-coil cavity of HIV 
gp41, wherein ao at the C-terminus and -am at the 
N-terminus are optional. 

49. The method of claim 46 wherein the candidate drug is 
detectably labeled and binding of the candidate drug to the 
HIV gp41 cavity is determined by detecting the presence of 
the detectable label on the HIV gp41 cavity. 

50. The method of claim 46 wherein the fusion protein 
comprises a soluble, trimeric version of the coiled-coil 
region of GCN4 and a sufficient portion of the N-peptide of 
HIV gp41 to include the HIV gp41 cavity. 

51. The method of claim 50 wherein the fusion protein is 
IQN17 or a variant thereof, wherein the amino acid sequence 
of IQN17 is SEQ ID NO.: 2. 

52. A method of identifying a drug that binds the N-helix 
coiled-coil cavity of HIV gp41 comprising: 

(a) combining: (1) a soluble model that presents the 
N-helix coiled-coil cavity of HIV gp41 in such a 
manner that it is available for binding by a drug and (2) 
a candidate drug, which is to be assessed for its ability 
to bind the N-helix coiled-coil cavity; and 

(b) determining whether the candidate drug binds the 
N-helix coiled coil cavity of the soluble model, 

wherein if binding occurs, the candidate drug is a drug 
which binds the N-helix coiled-coil cavity of HIV 
gp41. 

53. A method of producing a drug that binds the N-helix 
coiled-coil cavity of HIV gp41 and inhibits HIV entry into 
cells, comprising 

(a) combining (1) a candidate drug to be assessed for its 
ability to bind the N-helix coiled-coil cavity of HIV 
gp41 and inhibit HIV entry into cells and (2) a fusion 
protein which comprises a trimeric version of the 
coiled-coil region of a protein and a sufficient portion 
of the N-peptide of HIV gp41 to include the HIV gp41 
cavity, under conditions appropriate for presentation of 
the HIV gp41 cavity for binding by a drug; 

(b) determining whether the candidate drug binds the HIV 
gp41 cavity, wherein if binding of the candidate drug to 
the N-helix coiled-coil cavity of HIV gp41, occurs, the 
candidate drug is a drug which binds the N-helix 
coiled-coil cavity of HIV gp41, whereby a drug which 
binds the N-helix coiled-coil cavity of HIV gp41 is 
produced; and 



(c) assessing the ability of the drug produced in (b) to 
inhibit HIV entry into cells, wherein if the drug inhibits 
HIV entry into cells, it is a drug which binds the 
N-helix coiled-coil cavity of HIV gp41 and inhibits 
HIV entry into cells. 

54. The method of claim 53 wherein the fusion protein of 
(aX2) comprises a soluble, trimeric coiled-coil region of 
GCN4 and a sufficient portion of the N-peptide of HIV gp41 
to include the HIV gp41 cavity and the ability of the drug 
produced in (b) to inhibit HIV entry into cells is assessed in 
a syncytium assay, an infection assay or both. 

55. The method of claim 54 wherein the drug identified in 
(c) is further assessed for its ability to inhibit HIV entry into 
cells by in vivo assessment in an appropriate animal model. 

56. The method of claim 54 wherein the fusion protein is 
IQN17 or a variant thereof, wherein the amino acid sequence 
of IQN17isSEQ ID NO.:2. 

57. A method of producing a soluble model of the N-helix 
coiled-coil cavity of HIV gp41, comprising producing a 
fusion protein comprising: (a) a soluble, trimeric form of a 
coiled-coil and (b) a sufficient portion of the N-peptide 
region of HIV gp41 to comprise the amino acid residues 
which form the pocket of the N-helix coiled-coil of HIV 
gp41. 

58. The method of claim 57 wherein the protein of (a) is 
GCN4-pI Q I, GCN4-pII, Moloney Murine Leukemia Virus 
or ABC heterotrimer and the sufficient portion of (b) is 
selected from the group consisting of a portion comprising 
SEQ ID NO: 20; a portion comprising SEQ ID NO: 26; a 
portion comprising SEQ ID NO: 27 and a portion compris- 
ing SEQ ID NO: 42. 

59. The method of claim 57 wherein the fusion protein is 
IQN17 or a variant thereof, wherein the amino acid sequence 
of IQN17 is SEQ ID NO: 2. 

60. A method of producing a drug that binds the N-helix 
coiled-coil cavity of HIV gp41 comprising, 

(a) producing or obtaining a soluble model of the N-helix 
coiled-coil cavity of HIV gp41; 

(b) combining: (1) a candidate drug to be assessed for its 
ability to bind the N-helix coiled-coil cavity of HIV 
gp41 and (2) the soluble model of the N-helix coiled- 
coil cavity of HIV gp41; and 

(c) determining whether the candidate drug binds the 
N-helix coiled-coil cavity of HIV gp41, 

wherein if the candidate drug binds the N-helix coiled- 
coil cavity of HIV gp41, the candidate drug is a drug 
which binds the N-helix coiled-coil cavity of HIV 
gp41, whereby a drug which binds the N-helix coiled- 
coil cavity of HIV gp41 is produced. 

61. The method of claim 60 wherein the soluble model is 
a fusion protein which comprises a trimeric version of the 
coiled-coil region of a protein and a sufficient portion of the 
N-peptide of HIV gp41 to include the HIV gp41 cavity. 

62. The method of claim 61 wherein the fusion protein is 
IQN17 or a variant thereof, wherein the amino acid sequence 
of IQN17 is SEQ ID NO.:2. 

63. A method of producing a drug that binds the N-helix 
coiled-coil cavity of HIV gp41 and inhibits its entry into 
cells, comprising; 

(a) producing or obtaining a soluble model of the N-helix 
coiled-coil cavity of HIV gp41; 
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(b) combining: (1) a candidate drug to be assessed for its 
ability to bind the N-helix coiled-coil cavity of HIV 
gp41 and (2) the soluble model of the N-helix coiled- 
coil cavity of HIV gp41; 

(c) determining whether the candidate drug binds the 
N-helix coiled-coil cavity of HIV gp41, wherein if the 
candidate drug binds the N-helix coiled-coil cavity of 
HTV gp41, the candidate drug is a drug which binds the 
N-helix coiled-coil cavity of HIV gp41, whereby a drug 
which binds the N-helix coiled-coil cavity of HIVgp41 
is produced and; 

(d) assessing the ability of the drug produced in (c) to 
inhibit HIV entry into cells, 

wherein if the drug inhibits HIV entry into cells, it is a 
drug which binds the N-helix coiled-coil cavity of HIV 
gp41 and inhibits HIV entry into cells. 

64. The method of claim 63 wherein the soluble model is 
a fusion protein which comprises a trimeric version of the 
coiled-coil region of a protein and a sufficient portion of the 
N-peptide of HIV gp41 to include the HIV gp41 cavity. 

65. The method of claim 64 wherein the fusion protein is 
IQN17 or a variant thereof, wherein the amino acid sequence 
of IQN17 is SEQ ID No.;2. 

66. A drug produced by the method of claim 60. 

67. A drug produced by the method of claim 61. 

68. A drug produced by the method of claim 62. 

69. A drug produced by the method of claim 63. 

70. A drug produced by the method of claim 64. 

71. A drug produced by the method of claim 65. 

72. A method of identifying a peptide that binds to the 
N-helix coiled-coil cavity of HIV gp41, comprising: 

(a) combining IQN17 in the D-handedness with a phage 
display library of b-amino acid peptides, under condi- 
tions appropriate for binding of members of the library 
to IQN17 in the D-handedness; and 

(b) determining if binding occurs between IQN17 in the 
D-handedness and a member or members of the phage 
display library, wherein if binding occurs, a peptide that 
binds to the N-helix coiled-coil cavity of HIV gp41 in 
the D-handedness is identified. 

73. The method of claim 72 further comprising determin- 
ing the amino acid sequence of the member or members of 
the phage display library which bind to IQN17 in the 
D-handedness and producing peptides, in D form, compris- 
ing the amino acid sequences determined, wherein the 
peptides in D form bind the N-helix coiled-coil cavity in the 
natural L- handedness. 

74. A compound of Formula I, 




(F), 



wherein 

A is a D -amino acid residue or an N-substituted glycyl 
residue of the formula 




wherein one of R A1 and is a substituted or unsub- 
stituted aryl, heteroaryl, arylmethyl, heteroarylmethyl, 
benzo -fused aryl, benzo-fused heteroaryl, benzo-fused 
arylmethyl, benzo-fused heteroarylmethyl, cycloalkyl 
or bicycloalkyl; and the other is hydrogen; and W is 
hydrogen, methyl, trifluoromethyl or halogen, for 
example, fluorine, chlorine, bromine or iodine; 

B is a glycyl residue or a D-amino acid or N-substituted 
glycyl residue of the formula 



R B i x o 

I l li 

N — C — C 

I 

RB2 



wherein one of R B1 and R B2 is a substituted or unsub- 
stituted linear, branched or cyclic alkyl, aryl, arylalkyl, 
heteroaryl or heteroarylalkyl group; and the other is 
hydrogen; and X is hydrogen, methyl, trifluoromethyl 
or halogen, such as fluorine, chlorine, bromine or 
iodine; 

D is a D-amino acid residue or N-substituted glycyl 
residue of the formula 



Rdi y o 

i I n 

-N C C 

R D2 



wherein one of R DJ and R D2 is a substituted or unsub- 
stituted aryl, heteroaryl, arylmethyl, heteroarylmethyl, 
benzo-fused aryl, benzo-fused heteroaryl, benzo-fused 
arylmethyl; benzo-fused heteroarylmethyl, cycloalkyl 
or bicycloalkyl; and the other is hydrogen; andY is 
hydrogen, methyl, trifluoromethyl or halogen, such as 
fluorine, chlorine, bromine or iodine; 

E is a D-amino acid residue or N-substituted glycyl 
residue of the formula 



M 1 

■N — C — C 

I 

Rfi2 
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wherein one of R EJ and R E2 is a substituted or unsub- 
stituted, linear, branched or cyclic alkyl, aryl or aryla- 
lkyl group; and the other is hydrogen; and Z is hydro- 
gen, methyl, trifluoromethyl or halogen, such as 
fluorine, chlorine, bromine or iodine; 

K, L, M and N are each, independently, an amino acid 
residue or a polypeptide group comprising 2 to about 8 
amino acid residues; 

F is a direct bond or a difunctional linking group; and 

n, p, q, r and s are each, independently, 0 or 1. 

75. The compound of claim 74 wherein one of R A3 and 
R and one of R D1 and R D2 are, independently, a phenyl, 
substituted phenyl, naphthyl, substituted naphthyl, naphth- 
ylmethyl, substituted naphthylmethyl, benzyl or substituted 
benzyl group, or a group of the formula 




where J is O, S or NR, where R is H or linear, branched 
or cyclic C a -C 6 -alkyl; and 

R lf R 2 , R 3> R 4 and R s are independently selected from the 
group consisting of 

hydrogen, halogen and alkyl. 

76. The compound of claim 75 wherein R A1 and R D1 are 
both hydrogen. 

77. The compound of claim 74 wherein one of R B3 and 
R bi is hydrogen, substituted or unsubstituted linear, branched 
or cyclic Cj-C^-alkyl, phenyl, benzyl, naphthyl or naphth- 
ylmethyl. 

78. The compound of claim 77 wherein R B1 is hydrogen. 

79. The compound of claim 74 wherein one of R m and 
R is a substituted or unsubstituted, linear, branched or 
cyclic Cj-Q-alkyl group or a substituted or unsubstituted 
phenyl or naphthyl group and the other is hydrogen. 

80. The compound of claim 79 wherein R E3 is hydrogen. 

81. The compound of claim 74 wherein A and D are each 
a D-tryptophan residue and E is a D-leucine residue. 

82. The compound of claim 74 wherein K is a D-amino 
acid residue or an N-substituted glycyl residue comprising 
an amino-, carboxyl- or sulfhydryl substituted side chain and 
L is a polypeptide comprising 2 or 3 D-amino acid residues 
or N-substituted glycine residues. 

83. The compound of claim 74 wherein M is a polypeptide 
group comprising from 2 to about 8 D-amino acid residues, 
of which at least one comprises an amino-, carboxy- or 
sulfhydryl substituted side chain, and N is a polypeptide 
group comprising from 1 to about 6 amino acid residues, of 
which at least one is a lysine residue. 

84. The compound of claim 74 wherein F is a divalent 
linking group having a length from about 2 to about 40 
atoms. 



85. The compound of claim 84 wherein F is a polypeptide 
linking group of the formula — P n — , wherein n is an integer 
from 1 to about 12 and each P is independently an L- or 
D-amino acid or N-substituted glycyl residue, a glycyl 
residue or an N-substituted glycyl residue. 

86. The compound of claim 84 wherein F is a substituted 
or unsubstituted C 4 -C 40 -alkylene group or a C^Qyj-alky- 
lene group which is interrupted at one or more points by a 
heteroatom, a phenylene group or a heteroarylene group. 

87. The compound of claim 84 wherein F is a polysac- 
charide group comprising from 1 to about 10 glycoside 
groups. 

88. A method of producing a drug which fits the N-helix 
coiled-coil pocket of HIV gp41, comprising: 

(a) obtaining a crystal of a soluble, trimeric peptide model 
of the HIV gp41 hydrophobic pocket; 

(b) obtaining the atomic coordinates of the peptide model 
by X-ray diffraction studies using the crystal obtained 
in (a); 

(c) using the atomic coordinates obtained in (b) to define 
the N-helix coiled-coil pocket of HIV gp41; 

(d) identifying a molecule or compound which fits the 
N-helix coiled-coil pocket of HTV gp41; 

(e) obtaining the molecule or compound identified in (d); 
and 

(f) contacting the molecule or compound obtained in (e) 
with the N-helix coiled-coil pocket of HIV gp41 to 
assess the ability of the molecule or compound to fit the 
pocket of HIV gp41, 

wherein if the molecule or compound fits the N-helix 
coiled-coil pocket of HIV gp41, the molecule or com- 
pound is a drug which fits the pocket, whereby a drug 
which fits the N-helix coiled-coil pocket of HIV gp41 
is produced. 

89. The method of claim 88 wherein the soluble, trimeric 
peptide molecule comprises a soluble, trimeric form of a 
coiled coil and a sufficient portion of the N-peptide region of 
HIV gp41 to comprise the amino acid residues which form 
the pocket of the N-helix coiled-coil of HIV gp41. 

90. The method of claim 89 wherein in (f), the molecule 
or compound is contacted with the N-helix coiled-coil 
pocket of HIV gp41 by contacting the molecule or com- 
pound with IQN17, the N-helix of HIV gp41 or a polypep- 
tide which comprises the HIV pocket. 

91. The method of claim 89 wherein the soluble model is 
IQN17. 

92. The method of claim 88 wherein the crystal obtained 
in (a) is a crystal of IQN17 of space group C222. 

93. A method of producing a drug which binds the N-helix 
coiled-coil pocket of HIV gp41, comprising: 

(a) obtaining the atomic coordinates of IQN1; 

(b) using the atomic coordinates obtained in (a) to define 
the N-helix coiled-coil pocket of HTV gp41; 

(c) identifying a molecule or compound which fits the 
N-helix coiled-coil pocket of HTV gp41; 

(d) obtaining the molecule or compound identified in (c); 
and 
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(e) contacting the molecule or compound obtained in (d) 
with the N-helix coiled-coil pocket of HIV gp41 to 
assess the ability of the molecule or compound to fit the 
pocket of HIV gp41, 

wherein if the molecule or compound fits the N-helix 
coiled-coil pocket of HIV gp41, the molecule or com- 
pound is a drug which fits the pocket, whereby a drug 
which fits the N-helix coiled-coil pocket of HIV gp41 
is produced. 

94. The method of claim 93 wherein the atomic coordi- 
nates are the atomic coordinates in the PDB file represented 
in FIGS. 11A-11V. 

95. A method of identifying a molecule that binds to the 
N-helix coiled-coil cavity of HIV gp41, comprising: 

(a) combining IQN17 in the D-handedness with a bio- 
logically encoded library of ligands, under conditions 
appropriate for binding of members of the library to 
IQN17 in the D-handedness; and 



(b) determining if binding occurs between IQN17 in the 
D-handedness and a member or members of the bio- 
logically encoded library, wherein if binding occurs, a 
ligand that binds to the N-helix coiled-coil cavity of 
HIV gp41 in the D-handedness is identified. 

96. The method of claim 95 further comprising determin- 
ing the sequence of the member or members of the biologi- 
cally encoded library which bind to IQN17 in the D-hand- 
edness, and producing ligands, in the mirror-image 
handedness of the biologically encoded ligands, comprising 
the sequences determined. 

97, The method of claim 95 wherein the biologically 
encoded library is selected from the group consisting of a 
phage display library, a DNA library, an RNA Library and a 
biologically encoded peptide library. 
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